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Abstract 

We investigate the strength and the direction of information transfer in the U.S. 
stock market between the composite stock price index of stock market and prices 
of individual stocks using the transfer entropy. Through the directionality of the 
information transfer, we find that individual stocks are influenced by the index of 
the market. 
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1 Introduction 



Recently, economy has become an active research area for physicists. Physicists 
have attempted to apply the concepts and methods of statistical physics, such 
as the correlation function, multifractal, spin models, complex networks, and 
information theory to study economic problems [l,2,34,5,6,7,8 ffil0jlljl2)13jl4jl5)16fTl 

From the economic system, many empirical data reflecting the economic con- 
ditions can be obtained. Among them the time series of composite stock price 
index is one of the best data reflecting economics conditions well. The index 
data is used to analyze and predict the perspective of markets. The scientific 
interest in studying financial markets stems from the fact that there is a large 
amount of reasonably well defined data. 
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Information is an important keyword in analyzing the market or in estimating 
the stock price of a given company. It is quantified in rigorous mathematical 
terms [T^], and the mutual information, for example, appears as meaningful 
choice replacing a simple linear correlation even though it still does not specify 
the direction. The directionality, however, is required to discriminate the more 
influential one between correlated participants, and can be detected by the 
transfer entropy [20] . 

In many case, traders in the stock market refer to the index to invest in stocks. 
Therefore, we can guess that prices of stocks is affected by the composite 
stock index of the market. However, No attempt to measure the influence of 
index quantitatively has been accomplished, while it is found evident that 
the interaction therein is highly nonlinear, unstable, and long-ranged from 
many previous research on econophysics using financial time series. Schreiber 
[20] introduced the transfer entropy which measures dependency in time be- 
tween two variables. We focus quantitatively on the direction of information 
flow between the index data and the price of individual companies using the 
method of the transfer entropy. This concept of the transfer entropy has been 
already applied to the analysis of financial time series by Marschinski and 
Kantz [21]. They calculated the information flow between the Dow Jones and 
DAX stock indexes and obtained conclusions consistent with empirical ob- 
servations. While they examined interactions between two huge markets, we 
construct its internal structure between stock index and individual stocks. 



2 Transfer entropy 



The transfer entropy which measures directionality of variable with respect to 
time has been recently introduced by Schreiber [20] based on the probability 
density function (PDF). Let us consider two discrete and stationary process, 
I and J. The transfer entropy relates k previous samples of process / and I 
previous samples of process J is defined as follows: 

Tj^i = 2^ p{i t +i,it ,Jt ) log ; (fc) , (1) 

VKH+x I H ) 



where i t and j t represent the discrete states at time t of / and J, respectively. 
if^ and jf denotes k and I dimensional delay vectors of two time consequences 
I and J, respectively. The joint PDF p(it+u ^iJt 1 ) is t ne probability that the 
combination of it+i, 4^ an d jt have particular values. The conditional PDF 
p(it+i | it 1 jt ) an d p(h+i | 4^) are the probability that i t +i has a particular 
value when the value of previous samples it an d j® are known and 4^ are 
known, respectively. 
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The transfer entropy with index J — > / measures how much the dynamics 
of process J influences the transition probabilities of another process /. The 
reverse dependency is calculated by exchanging i and j of the joint and condi- 
tional PDFs. The transfer entropy is explicitly asymmetric under the exchange 
of i t and j t . It can thus give the information about the direction of interaction 
between two time series. 

The transfer entropy is quantified by information flow from J to I. The transfer 
entropy can be calculated by subtracting the information obtained from the 
last observation of / only from the information about the latest observation / 
obtained from the last joint observation of / and J. This is the main concept 
of the transfer entropy. Therefore, the transfer entropy can be rephrased as 

Tj^ I = h I (k)-h IJ (k,l), (2) 
where 

M fc ) = -&(w,4 fe) )iogp(^+i 1 4 k) ) (3) 

MM = -£p(W,4 fc) ,jf )iogp(w I ). (4) 

3 Empirical data analysis 

We analyze daily records of the S&P 500 index (GSPC), Dow Jones index 
(DJI) and stock price of selected 125 individual companies. The dataset con- 
sists of about 4,000 simultaneously recorded data points during the period 
June 1, 1983 to May 31, 2007. We use logarithmic price difference as follows: 

x n = ln(S n ) - ln(S' n _i), (5) 

where S n means index or stock price of n-th trading day. The first step in 
analysis for the transfer entropy is to discretize the time series by some coarse 
graining. Quite often, statistical studies which use the entropy assume that the 
variables of interest are discrete, or may be discretized in some straightforward 
manner. We partitioned the real value x n into discretized price change A n . In 
the concrete, A n = for x n < —d/2 (decrease), A n = 1 for — d/2 < x n < d/2 
(intermediate), A n = 2 for x n > d/2 (increase) are chosen. 

When data is discretized, it is important to determine the size of d because 
probability of each state is varied by d. In case of very small d, most of return 
value is belonged not to the intermediate state but to the increase or decrease 
states. Therefore, the data can be regarded as two-states practically. Also, 
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Fig. 1. Probability of states for (a) the GSPC, (b) the DJI, and (c) individual stocks 
as a function of d, where (c) is the average probability for all individual stocks. 

when d is very high, the greater part of return is fallen under intermediate 
state. So data is able to be considered in one-state system. As the value of 
d, the range of intermediate state, is changed, the probability of each state 
is varied. Fig. [I] represents the probability of each state. The probabilities 
of increase and decrease states are almost same. Therefore, the probability 
of intermediate state increases as d is increasing, while those of increase and 
decrease are reduced. Around d = 0.003, the probabilities of three states are 
approximately same for both of composite stock index. On the other hand, 
individual stocks represent the same probability at d = 0.006. The reason, 
why the value of d which makes the same probability for composite stock 
index is not same to that for individual stock prices, is that index usually 
does not change its value abruptly in a day compare with individual stocks, 
because composite stock index is average or weighted average of individual 
stock prices. 
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Fig. 2. Mean value of the transfer entropy for (a) the GSPC and (b) the DJI as a 
function of d: ■ for Tj^ s , ▲ for Ts->i, □ for Tf^f e , and A for T^f e . 

Fig. [2] shows the mean value of the transfer entropy between composite stock 
index (J) and price of individual stocks (S) for the GSPC and the DJI as a 
function of d with k = 1 and 1 = 1. The transfer entropy from the stock index 
to the stock prices, Ti^s, is almost higher than that from the stock prices to the 
stock index, T$-+i- At d = 0, discretized data is fallen into two-states because 
the intermediate state is disappeared. Therefore, it has smaller value of the 
transfer entropy compared with that for three states. As d is increasing, the 
number of state turns to three, and the transfer entropy is maximized around 
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d = 0.015. Above d which makes maximized the transfer entropy, the larger d, 
the larger probability of the intermediate state. Moreover, above about 0.02, 
P(l) for the index goes close to 1. Therefore, the transfer entropy is deceasing 
and finally goes to because all data is fallen into the intermediate state at 
very large d. 

Open squares (□) and triangles (A) of Fig. [2] represent the transfer entropy 
from shuffled data. As expected, the transfer entropy from shuffled data is 
smaller than that from the original data, and also the difference between 
Tj^s an d Ts^i is disappeared below d ~ 0.02 and above d ~ 0.04 in both 
indices. In the range from around 0.02 to around 0.04, number of states for the 
indices is 1, while it is still 3 for individual stocks. Therefore, this difference 
between them triggers discrepancy of the transfer entropy between the indices 
and stocks. 
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Fig. 3. At d = 0.015, the frequency of T/^ s and T s ^i for (a) the GSPC and (b) 
the DJI, and the frequency of difference between Tj^s and Tg^i for (c) the GSPC 
and (d) the DJI. 



Figs. [3](a) and [3](b) show the frequency of the transfer entropy between com- 
posite stock index and stock prices at d = 0.015. Frequency distribution of 
the transfer entropy from index to stocks is more skewed to right than that 
from stocks to index. Figs. [3]^c) and [3](d) show the difference between Tj^ s 
and Ts^i- For the majority of companies, the transfer entropy from index to 
stocks are larger than the transfer entropy for the reverse. However, about 
35% companies gives information to index of the next day. 



5 



+ +. *++ +++ 
+ ,++++ ,+ 

+++ ++ + 



0.006 0.012 

T,_ >s 




Fig. 4. The relation between Ti^s and Ts_^/ for (a) the GSPC and (b) the DJI. 



Fig. H] shows the positive relation between Tj_>5 and Ts_»j. The value of cor- 
relation between them is 0.51(9) for the GSPC and 0.40(9) for the DJI. In 
Table [H the top 10 companies of the transfer entropy is listed. Among the top 
10 companies, Xerox Corp., Entergy Corp., Consolidated Edison Inc., Center- 
point Energy Inc., and PG & E belong to the top 10 companies for both Tj^g 
and Tg-^i- Both Fig. H] and Table [1] show that the higher the higher 

Ts^i, though the average value of Tj^s is higher than one of Tg^j. Conse- 
quently, individual stocks are able to be divided into highly connected stocks 
and lowly connected stocks to the market. 



4 Conclusion 



The concept of the transfer entropy has been proposed for finding direction of 
casuality. Using the measure, we are able to investigate the information flow 
between stock index and individual stocks. Our results indicate that there is a 
stronger flow of information from the stock index to the individual stocks than 
vice versa, and the transfer entropy for both direction has positive correlation. 
Moreover, we expect similar result to the U.S. market for other stock markets. 
As a matter of fact, the result of the information flow in Japan stock market 
also produces the same directional casuality although it is not shown in this 
paper. 

We have desire to find the correlations between the direction of information 
flow and company profile. However, we could not find it yet. The division of 
individual stocks due to the direction of casuality between composite stock 
index and companies may be useful for the stock investment strategies. 
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GSPC -► stock 



stock -» GSPC 



1 


Pepsico Inc. 


Centerpoint Energy Inc. 


2 


FPL Group Inc. 


Duke Energy Corp. 


3 


Xerox Corp. 


Xerox Corp. 


4 


Entergy Corp. 


Bristol-Myers Squibb Co. 


5 


Consolidated Edison Inc. 


International Business Machines Corp. 


6 


Walt Disney co. 


American Electric Power Co. Inc. 


7 


Union Pacific Corp. 


PG & E Corp. 


8 


United Technologies Corp. 


TXU Corp. 


9 


Clorox Co. 


Wyeth 


10 


Centerpoint Energy Inc. 


Consolidated Edison Inc. 





DJI — > stock 


stock — > DJI 


1 


Walt Disney Co. 


Xerox Corp. 


2 


Consolidated Edison Inc. 


Centerpoint Energy Inc. 


3 


Xerox Corp. 


Willams Companies Inc. 


4 


Whirlpool Corp. 


Duke Energy Corp. 


5 


Pepsico Inc. 


Southern Co. 


6 


FPL Group Inc. 


PG & E Corp. 


7 


Coca-Cola Co. 


American Electric Power Co. Inc. 


8 


United Technologies Corp. 


Honeywell International Inc. 


9 


Corning Inc. 


Entergy Corp. 


10 


PG & E Corp. 


Bristol-Myers Squibb Co. 



Table 1 

The top 10 companies of the transfer entropy. 
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